Automatic Distinction of Fernando Pessoas' Heteronyms
نویسندگان
چکیده
Text Mining has opened a vast array of possibilities concerning automatic information retrieval from large amounts of text documents. A variety of themes and types of documents can be easily analyzed. More complex features such as those used in Forensic Linguistics can gather deeper understanding from the documents, making possible performing difficult tasks such as author identification. In this work we explore the capabilities of simpler Text Mining approaches to author identification of unstructured documents, in particular the ability to distinguish poetic works from two of Fernando Pessoas’ heteronyms: Álvaro de Campos and Ricardo Reis. Several processing options were tested and accuracies of 97% were reached, which encourage further developments.
منابع مشابه
Levantamento de Modelos de Dados em Sistemas Legados
as vazia; !^* Os grupos representados como associaJes do modelo abstracto significam que a Chane das suas tabelas cont8m elementos das chaves das entidades Que relacionam. APresenta-se um exemplo na figura I , onde a entidade abstracta "NII" representa o grupo das tabelas que identificam e descrevem as pessoas. A entidade abstracta "UNIDADE" representa o grupo das tabelas Que identificam e desc...
متن کاملProjeto D4ALL: acesso e manipulação de diagramas por pessoas com deficiência visual
D4ALL é um projeto que tem por objetivo a realização de pesquisa, desenvolvimento e extensão no contexto de abordagens e técnicas alternativas de representação e interação para acesso e manipulação de diagramas por pessoas com deficiência visual. Os principais resultados incluem formação de recursos humanos, produção técnica de tecnologia assistiva e realização de atividades de extensão para at...
متن کاملAutomatic Distinction of Arguments and Modi ers: the Case of Prepositional Phrases
The automatic distinction of arguments and modiiers is a necessary step for the automatic acquisition of subcategorisation frames and argument structure. In this work, we report on supervised learning experiments to learn this distinction for the diicult case of prepositional phrases attached to the verb. We develop statistical indicators of linguistic diagnostics for argumenthood, and we appro...
متن کاملSituations as indices and as denotations
A distinction is drawn between situations as indices required for semantically evaluating sentences and situations as denotations resulting from such evaluation. For atomic sentences, possible worlds may serve as indices, and events as denotations. The distinction is extended beyond atomic sentences according to formulae-as-types and applied to implicit quantifier domain restrictions, intension...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015